A Novel Approach to Clustering and Assembly of Large-Scale Roche 454 Transcriptome Data for Gene Validation and Alternative Splicing Analysis

نویسندگان

  • Vitoantonio Bevilacqua
  • Fabio Stroppa
  • Stefano Saladino
  • Ernesto Picardi
چکیده

In this paper we propose a new implementation of EasyCluster, a robust software developed to generate reliable clusters by expressed sequence tags (EST). Such clusters can be used to infer and improve gene structures as well as discover potential alternative splicing events. The new version of EasyCluster software is able to manage genome scale transcriptome data produced by massive sequencing using Roche 454 sequencers. Moreover it can speed up the creation of gene-oriented clusters and facilitate downstream analyses as the assembly of full-length transcripts. Finally available annotations can now be employed to improve the overall clustering procedure. The new EasyCluster implementation embeds also a graphical browser to provide an overview of results at genome level, simplifying the interpretation of findings to researchers with no specific skills in bioinformatics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An improved procedure for clustering and assembly of large transcriptome data

Motivations Expressed sequence tags and full-length cDNAs represent an invaluable source of evidence for inferring reliable gene structures and discovering potential alternative splicing events [1]. However, to fully exploit their biological potential, correct and reliable EST clusters are required. To fill this gap we developed the program EasyCluster that resulted the most accurate when compa...

متن کامل

Poly (A)+ Transcriptome Assessment of ERBB2-Induced Alterations in Breast Cell Lines

We report the first quantitative and qualitative analysis of the poly (A)⁺ transcriptome of two human mammary cell lines, differentially expressing (human epidermal growth factor receptor) an oncogene over-expressed in approximately 25% of human breast tumors. Full-length cDNA populations from the two cell lines were digested enzymatically, individually tagged according to a customized method f...

متن کامل

Clustering of Short Read Sequences for de novo Transcriptome Assembly

Given the importance of transcriptome analysis in various biological studies and considering thevast amount of whole transcriptome sequencing data, it seems necessary to develop analgorithm to assemble transcriptome data. In this study we propose an algorithm fortranscriptome assembly in the absence of a reference genome. First, the contiguous sequencesare generated using de Bruijn graph with d...

متن کامل

Survey of the Applications of NGS to Whole-Genome Sequencing and Expression Profiling

Recently, the technologies of DNA sequence variation and gene expression profiling have been used widely as approaches in the expertise of genome biology and genetics. The application to genome study has been particularly developed with the introduction of the next-generation DNA sequencer (NGS) Roche/454 and Illumina/Solexa systems, along with bioinformation analysis technologies of whole-geno...

متن کامل

Comparative Analysis of Kabuli Chickpea Transcriptome with Desi and Wild Chickpea Provides a Rich Resource for Development of Functional Markers

Chickpea (Cicer arietinum L.) is an important crop legume plant with high nutritional value. The transcriptomes of desi and wild chickpea have already been sequenced. In this study, we sequenced the transcriptome of kabuli chickpea, C. arietinum (genotype ICCV2), having higher commercial value, using GS-FLX Roche 454 and Illumina technologies. The assemblies of both Roche 454 and Illumina datas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011